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IMAGING SYSTEM AND BUSINESS METHODOLODY 

BACKGROUND 

Field of the Invention 

[0001] The present invention relates generally to document imaging and 
processing and more particularly to systems and methods for digitizing 
documents and storing and accessing the same. 

Background of the Invention 

[0002] Even with the widespread use of computers in business and in 
daily life, the use of paper-based documents to record, communicate and store 
information remains exceedingly popular. Although software applications 
offer new and improved functions such as character recognition, managed 
document archival and retrieval and specialized image processing, many 
businesses can not leverage these capabilities because they maintain a 
significant amount of information in paper form rather than electronically. 
[0003] Various other drawbacks are associated with business processes 
that involve storing large amounts of information in paper form as opposed to 
maintaining such information electronically. For example, pages can easily be 
lost or misplaced, large physical spaces may be required for storing the 
documents, and information may not be readily accessed through search 
applications which are available for electronically stored information. 
[0004] In some contexts, even though information was originally created 
and stored using paper documents, conversion to electronic format via 
digitization is required for one or more reasons. For example, in the case of 
litigation, it is often necessary to store, access, produce and analyze a large 
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number of documents associated with the particular dispute. In most cases, 
the overall business process associated with converting physical documents in 
various formats into digital form is error prone, costly and time-consuming. 
[0005] Various aspects of the overall task further complicate the process. 
For example, "paper-based" documents really represent many forms of 
physically stored information. This includes formats such as paper, microfilm 
and microfiche as well as other formats. Each of these formats generally 
requires its own, separate scanning device. Because of this, boxes of 
documents must be separated and fed into different scanning devices thus 
giving rise to the possibility that documents could be misplaced and/or the 
original document ordering could be lost. 

[0006] Difficulties in maintaining document integrity and the original 
ordering also arise during other steps in the digitization business process. 
Boxes of documents and/or individual documents may be lost or caused to be 
out of order during pickup and/or transportation from the place where the 
documents are stored to the place where the documents are to be scanned. 
With typical digitization business processes, documents can also be lost or 
caused to be out of order during the time they reside at the scanning location 
and/or during the scanning process itself. 

[0007] Yet another problem associated with typical document imaging 
business processes arises out of the fact that both human and machine error 
may manifest themselves during the process of scanning of physical 
documents. As a result, physical documents to be scanned can be lost, never 
scanned, scanned out or order and/or improperly scanned. Because of this 
problem it is generally not possible to validate the integrity of the scanned 
documents, their contents or their ordering. The inability to validate sets of 
imaged documents to a particular level of probability can, in turn, lead to 
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situations in which the imaging process may not be applicable for a particular 
need. 

[0008] For example, in the context of litigation, if document imaging was 
not originally done according to a process with a sufficient level of integrity 
verification, then difficulties may arise in connection with how a court treats 
the available evidentiary universe. Similarly, verification of document 
integrity can be a concern when documents are specifically imaged after the 
fact for the purposes of litigation. Imaging processes may also be unusable or 
suspect in other cases such as in the context of imaging, storing and 
cataloguing vital records such as birth certificates, passports as well as various 
other governmental and commercial vital records. 

[0009] Document integrity in connection with an imaging process is of 
even more vital concern in the case where the source documents are destroyed 
following imaging. Often times, imaging is performed for the primary 
purpose of consolidating space and physical storage requirements. In this 
case, documents are typically destroyed or, at least, stored off-site in a 
relatively inaccessible form following digitization. In this case, electronic 
document integrity is even more critical since the source documents no longer 
exist or are difficult to retrieve. 

SUMMARY OF THE INVENTION 

[0010] It is therefore a primary object of the present invention to provide 
a business process and methodology which improves upon prior art 
methodologies and their related drawbacks as described above. 
[0011] It is another object of the present invention to provide a business 
process and methodology which offers a higher level of assurance of 
document integrity resulting from the process. 
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[0012] It is yet another object of the present invention to provide a 
business process and methodology which includes inventory control such that 
the status of physical documents may be tracked during imaging. 
[0013] It is a still further object of the present invention to provide a 
business process and methodology which ensures that all inventoried 
documents are imaged as a result of the process. 

[0014] It is a further object of the present invention to provide a business 

process and methodology which ensures that documents from different 

scanning stations are merged during the scanning process. 

[0015] It is a yet further object of the present invention to provide a 

manageable system for imaging, tracking, storing and retrieving documents 

which are organized with a large number of hierarchical levels. 

[0016] It is a still further object of the present invention to provide a 

document imaging control system which operates according to the 

methodologies described above 

[0017] These and other objects of the present invention are obtained 
through the use of a novel business process which is initiated with the pickup 
and transport of documents to a scanning facility. Following pickup and 
transport, a number of additional steps are performed toward the goal of 
digitizing all documents without the loss of any documents and with assurance 
that all documents are properly catalogued, tracked and scanned. Various 
aspects of the process are designed to provide the highest possible assurance 
of document integrity. 

[0018] The system and methodology of the present invention operate to 
track the status of physical documents during the course of digitization in 
order to assure document integrity. Further, the system and process herein 
disclosed assures proper merging of documents which may be scanned in 
parallel and viamultiple scanning stations. In this way it is also possible to 
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track, organize and scan documents associated with multiple projects in 
parallel and at one or more scanning stations. 

[0019] Additionally, according to the process of the present invention, 
hierarchical organization of documents can be handled regardless of the 
number of levels required. Such multiple level document images are 
translated into a one dimensional linear directory structure filing system for 
easier access and more efficient quality control capabilities. 
[0020] Additional capabilities associated with the document imaging 
control system of the present invention include the ability to easily customize 
the system for use by various users according to their individual preferences 
and requirements. The system of the present invention may be deployed in a 
variety of environments including as an application service provider (ASP) 
solution, in a client server network or as a standalone software package. 
[0021] These and other advantages and features of the present invention 
are described herein with specificity so as to make the present invention 
understandable to one of ordinary skill in the art. 



BRIEF DESCRIPTION OF THE DRAWINGS 

[0022] Figure 1 is a block diagram illustrating the components of the 

system of the present invention in a preferred embodiment; 

[0023] Figure 2 is a diagram illustrating an exemplary set of hierarchical 

levels within a set of documents to be processed for imaging; 

[0024] Figure 3 is an exemplary case level separator barcode which is 

used according to the process of the present invention in a preferred 

embodiment thereof; 

[0025] Figure 4 is an exemplary level 1 separator barcode which is used 
according to the process of the present invention in a preferred embodiment 
thereof; 
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[0026] Figure 5 is an exemplary level 2 separator barcode which is used 
according to the process of the present invention in a preferred embodiment 
thereof; 

[0027] Figure 6 is an exemplary level 3 separator barcode which is used 
according to the process of the present invention in a preferred embodiment 
thereof; 

[0028] Figure 7 is an exemplary leaf separator barcode which is used 
according to the process of the present invention in a preferred embodiment 
thereof; 

[0029] Figure 8 is a process flowchart illustrating the overall digitization 
process of the present invention; and 

[0030] Figure 9 is an example of a tracking sheet created during the 
CHECK-IN step of the digitization process of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0031] The present invention for document imaging and management is 
now described. The present invention comprises a system for document 
imaging and management as well as a process therefor. In the description that 
follows, numerous specific details are set forth for the purposes of 
explanation. It will, however, be understood by one of skill in the art that the 
invention is not limited thereto and that the invention can be practiced without 
such specific details and/or substitutes therefor. The present invention is 
limited only by the appended claims and may include various other 
embodiments which are not particularly described herein but which remain 
within the scope and spirit of the present invention. 

[0032] Fig. 1 is a block diagram of an architecture which may be used in 
connection with the implementation of the present invention. Although this 
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figure and the following description reflect an embodiment in which the 
process control of the present invention is web-based or in which the process 
control is accessed via some other network connection, the present invention 
is not necessarily limited thereto. As discussed above, other embodiments are 
possible including a case in which the control process of the present invention 
is available on a standalone basis such as a standalone software program. 
[0033] Turning now to Fig. 1, a description of the imaging system 100 of 
the present invention and the specific components included thereon is now 
provided. Network 120 provides communication between some or all of the 
other devices within system 100. Network 120 can take one of many forms 
including, for example, a local area network (LAN), a wide area network 
(WAN), the internet or other communication networks now known or 
hereinafter developed. Communication via network 120 may occur either 
through physical wiring or through a wireless communication protocol or 
both. One or more terminals (T) 1 10 are associated with the system and 
communicate with other components via network 120. These terminals 
preferably include input and output capabilities including a keyboard, a 
mouse, a display screen and a local processor. As such, terminals 1 10 may be 
personal computers or alternatively, they may be dumb terminals without 
processing capability. Each of terminals 1 1 0 may take a different form or 
they may be the same. 

[0034] System 100 further preferably includes one or more scanners 160 
which read images and text on physical media and convert them to electronic 
form. These scanners are preferably capable of scanning a variety of media 
such as paper documents, microfiche, microfilm and other formats now 
known or hereinafter developed. Multiple scanners 160 capable of reading the 
same format type may be included within system 100 to increase throughput. 
System 100 also preferably includes at least one barcode reader 140 which is 
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capable of reading barcodes or other printed codes and transmitting them to 
process control 170. 

[0035] Process control 170 is at the heart of system 100 and functions to 
control the overall digitization process including the novel aspects of such 
process as described in more detail below. In one embodiment, process 
control system may comprise one or more software programs running on a 
server or other computer. Process control 1 70 issues commands to various 
devices through network 120 and receives information concerning device and 
process status through network 120. Users may customize and/or otherwise 
interact with process control 1 70 via one or more of terminals 1 1 0. 
[0036] System 100 may further include labeler 150 which generates labels 
including bar codes or other coding schemes. Such labels are used in 
connection with the overall novel process of the present invention as 
described in more detail below. Various output devices 130 may also be 
included. These devices may be used to write digital output resulting from 
scanning onto various media such as tape, CD-ROM, floppy disk and other 
storage media. Once created, this output media may be physically stored 
and/or delivered to one or more customers. 

[0037] System 100 may also include one or more storage devices 1 80 
which may also be employed to store digital output from scanning. Storage 
devices 180 may comprise various devices such as personal computers, 
servers, disk drives, RAID storage arrays as well as other devices capable of 
storing relatively large amounts of data. Given appropriate capability for 
cataloguing and systematically accessing data stored on storage devices 1 80, 
users may access such data without the need for any additional physical 
storage media through the use of one or more of terminals 1 10. 
[003°] Now that a description of a possible architecture for the system of 
the present invention has been provided, and with reference to Fig. 2, a 
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discussion of hierarchical levels associated with a typical document 
digitization process is provided. In addition, the following discussion 
provides information regarding the way in which the system and method of 
the present invention can handle multiple hierarchical levels in a manner such 
that quality control can be easily, rapidly and accurately effected. 
[0039] The present invention can handle document imaging tasks that 
involve a large number of hierarchical levels. In many situations, a number of 
hierarchical levels are associated with an imaging task. The highest level is 
typically referred to as a project. Some projects involve the imaging of many 
cases. Cases may alternatively be referred to as files or records. In the 
example of a case, such as an ongoing matter being litigated, the case may 
include many categories of documents, and each category may be associated 
with many documents. 

[0040] In the following example, three levels of documents are involved. 
It will be understood by one of skill in the art that various document sets may 
involve a larger or smaller number of levels and the system and method of the 
present invention may operate in all such cases. The following example is 
merely exemplary and for the purpose of generally illustrating a hierarchical 
structure which may be associated with a set of documents processed 
according to the teachings herein. 

[0041] In this example, the first level is referred to as the case level, the 
second level is referred to as the category level, and the third level is referred 
to as the document level. The innovative system of the present invention can 
handle unlimited levels of hierarchies. As discussed below in more detail, the 
unique process of the present invention involves the use of separator sheets. 
Every separator sheet used has a barcode on it. The barcode, which serves as 
the identification of a new starting point, may delineate a new case, a new 
category, or a new document. To the extent additional or alternative levels 



ATTORNEY DOCKET ECO-001 10 



PATENT 



exist, separator sheets are used to delineate the starting point for each new 
instance of an item falling under that level. 

[0042] In a preferred embodiment of the present invention and according 
to the example being discussed, the formats of the barcodes are defined as 
follows: 

Project ID 

The Project ID is usually a unique ID identifying a project. The Project ID can 
be a number or a string. 

Case ID 

The Case ID is usually a unique ID identifying a case. It is usually an 
identification number or string, such as a case number, social security number, 
etc. 

Case Level Barcode 

The Case Level Barcode is defined as follows: 
PREFIX - PROJECT ID - CASE ID 

Where PREFIX and PROJECT ID are optional. The PREFIX is usually used 
to make the Case Level Barcode unique from all possible barcodes in 
documents, so the system can identify the Case Level Separator with no 
mistakes. The Case Level Barcode is unique for each case. 

Level Separator Barcode 

The Level Separator Barcode is used to separate a group of papers, which can 
be a category or document. It does not have to be unique. It can be the same 
for the same level of group of papers. It is defined as follows: 

PREFIX - LEVEL INDICATOR - OPTION 

Where the LEVEL INDICATOR is a sequence number or other protocol 
which specifies the level of the current group of papers, for example, 1 
specifies the level right under the case level, 2 specifies the level under level 
1, and so on. The PREFIX and OPTION fields are optional. The PREFIX is 
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usually used to make the Level Separator Barcode unique from all possible 
barcodes in documents, so the system can identify the Level Separator with no 
mistakes. OPTION may be used to make the Level Separator Barcode 
optically understandable and meaningful. 

Leaf Barcode 

The Leaf Barcode may be used to specify that a level has no further levels 
under it. In a preferred embodiment, it has following format: 

LEAF 

The barcode does not have to be "LEAF". It can be any unique symbol which 
is understood by the system as being indicative of the last level in the paper 
group hierarchy. The Leaf barcode is optional. When used, the preparation 
work may be easier because it is a barcode which can be repeatedly used. 

The Case Level Barcode, Level Separator Barcode, and Leaf Barcode are also 
collectively called Separator Barcodes. 



[0043] By way of example and with specific reference to Fig. 2, assume 
that a project is to image all documents associated with a personal injury 
litigation. Specifically, a case between Smith and Jones is in the process of 
imaging preparation. In this example, using no PREFIX, the PROJECT ID is 
defined as PIL0001. The Case ID is S v J 1414, where 1414 is the unique 
docket number assigned by the court. Further assume that under this case 
there are various groupings of documents, the first being "correspondence", 
the second being "background information" and the third being "pleadings". 
Those three groups represent the first level under case level, and are referred 
to as categories. Finally assume that there are three letters under 
"correspondence" - A, B and C - that there are 2 brochures under 
"background information" - D and E - and that there are 4 pleadings under 
"pleadings" - F, G, H and I. Each of the lettered documents represents the 
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document level, which is level 2. Finally, assume that for pleading G, there is 
a cover sheet, a complaint and a signature page. Each of these aspects of 
document G represents a lower hierarchical level below the document level. 
[0044] The barcodes for this project, specifically for the Smith v Jones 
case, are as follows: 

The Case Level Barcode, which used to identify Smith and Jones' personal 
injury litigation case, is 

PIL0001-SvJ1414. 

An example Case Level Barcode for the Smith v. Jones case is provided in 
Fig. 3. 

The Level Separator Barcodes for level one, the category level, are 
respectively: 

1-CORRESPONDENCE 

1-BACKGROUD 

1-PLEADINGS 

without using PREFIX. An example of a Level Separator Barcode for level 
one, particularly the one for "Pleadings" is provided in Fig. 4. 

The Level Separator Barcode for level two, the document level, is 

2-DOCUMENT 

without using PREFIX. An example of a level 2 Level Separator Barcode for 
"Document" is provided in Fig. 5. 

The Level Separator Barcodes for level three, the letter document level, are 
respectively: 

3-COVER 

3-COMPLAINT 

3-SIGNATURE 

without using PREFIX. An example of a Level Separator Barcode for level 3 
particularly the "Complaint" separator is provided in Fig. 6. 
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No Leaf Barcode is used in this project A Leaf Barcode could, however, be 
used for documents A, B, C, D, E, F, H, and I, since they have no further 
group of papers underneath. An example of a Leaf Barcode is provided in 
Fig. 7. 

[0045] In order to scan multiple-level prepared documents into a one 
dimensional directory structure in filing system according to the present 
invention, the use of a separator sheet with a barcode defined as above will 
realize the one-dimensional structure. Because of the one-dimensional- 
directory structure, the imaging system of the present invention has a simple 
filing structure so that system recovery, problem tracking, and system 
maintenance become easier and more efficient. 

[0046] The process of the present invention proceeds according to the 
following algorithm in a preferred embodiment in order to realize the one- 
dimensional filing structure: 



DEFINE Current__String AS A STRING 
DEFINE Current_Level AS INTEGER 

IF the system DETECTS a barcode 
THEN 

IF the barcode IS a separator barcode 
THEN 

IF the barcode IS a Case Level Barcode 
THEN 

CurrentString = Case Level Barcode 
SAVE Current_String AS Case Level String 
CurrentLevel = Case Level 
ELSE 

IF NOT Leaf Barcode 
THEN 

CurrentLevel = LEVEL INDICATOR of Level Separator Barcode 

ELSE 

Current_Level = LEAF 
END IF 

CurrentString = (Current_Level - I ) String 
IFCurrentJLevel-1 =0 
THEN 

CurrentLevel - 1 = Case Level 
END IF 

Current_String = Current_String + the barcode + Sequence # 
END IF 

CREATE a directory using the CurrentjString 
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END IF 
END IF 

[0047] In order to reach 100% no-missing-scanning and scan-in- sequence 
quality control (QC) accuracy, the process of the present invention according 
to a preferred embodiment thereof places a unique sequence number on each 
page of document before scanning. This number confirms the pages, the 
sequence of the pages involved in a document and the box they belong to. The 
system may use Optical Character Recognition (OCR) to read the number on 
each image and point out the place in the image batch where any out-of-page- 
sequence, missing number, odd number, or other suspicious problems are 
found. The system can then interactively work with operator to make sure the 
problems are resolved. 

[0048] In order to automatically merge all images scanned at different 
scan stations into one document, the innovative procedure places a 
corresponding indicator page for each special document page or pages (if the 
pages are in a group sequentially together) within a document. Whenever such 
an indicator page is detected, the image file name for the indicator page will 
be the regular image file name appending with the indicator page barcode plus 
a in between. For example, when a drawing indicator page is detected, this 
drawing page has a barcode as D0001, the regular image file name is 
00000003.TIF, then the file name for this indicator page will be 00000003- 
D0001.TIF. 

[0049] The system will look into a file or files scanned from a wide 
format scanner station under a filing directory named D0001 because the 
folder name of such drawing has a barcode D0001 attached. The system then 
appends all the files under filing directory, D0001, with a designator resulting 
in files named as D0001-0001.TIF, D0001-0002.TIF, , respectively. 
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These files are placed after the 00000003 -D0001 .TIF file. The 00000003- 
D0001.TIF will not be removed until the image Quality Control (QC) process 
is done because the 00000003-D0001 .TIF serves as an indicator of QC point 
as discussed in more detail below. 

[0050] Referring now to Fig. 8, the process for digitizing data while 
obtaining the benefits described above is now described. In the first step 
(PICKUP) illustrated by block 205, a number of sub-steps are undertaken. 
Prior to documents actually being picked up, a document package list (DPL) 
is requested and received from the document owner. The DPL generally 
contains an entry for each container holding one or more documents as well as 
a description of the container and/or a container number. Individual 
documents not held by a container are also itemized on the DPL. 
[0051] During the PICKUP step 205, the DPL is verified against the 
physical documents and containers. A unique barcode is issued and attached 
to each document container and each uncontainerized document and the 
barcode is entered into a recording device such as a palmtop, barcode reader, 
or other devices. As alluded to above, labeler 150 may be used to generate the 
barcode labels. These barcode labels are read and entered into system 1 00 via 
barcode reader 140 or some other device such that system 100, and in 
particular, process control 170 becomes aware of all document containers 
containing documents to be digitized as well as the corresponding coding for 
each such container or uncontainerized document. 

[0052] Both barcode reader 140 and labeler 150 may be portable devices 
such that they may be easily brought to the facility initially housing the 
documents. In a preferred embodiment of the present invention, these devices 
may have a wireless communication link to network 120 and thereby system 
1 00 for data collection and for importing such data into system 1 00. PICKUP 
step 205 also proceeds such that all documents and containers are sealed and 
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transported if the digitization is to take place off of the document owner site. 
In this case, the documents and containers are next loaded onto a transport 
vehicle and transported to a digitizing facility or some other location wherein 
digitization is to take place. 

(0053] The next step in the process of the present invention is CHECK-IN 
step 210. In this step, when the documents and containers arrive at the 
digitization facility, the DPL is first be imported into system 100. This may be 
accomplished manually via one of terminals 1 10 or through some automated 
process such as a coded DPL which may be read into system 1 00 without 
human intervention. Next, each time a container or noncontainerized 
document is brought from the transport vehicle into the digitization facility the 
barcode label previously placed thereon is scanned. This step verifies that 
each container and noncontainerized document which was picked up from the 
original document location has been successfully transferred to the 
digitization facility. 

[0054J CHECK-IN step 210 may also include the sub-step of verifying 
and recording the cases contained in each container. Alternatively, this sub- 
step may be accomplished during the PICKUP step 205. However, in most 
situations it is not practical to do inventory the individual cases in each 
container at the original site due to pickup time, space and financial 
constraints. It is therefore preferable that individual cases be itemized at the 
digitization facility. It will be understood by one of skill in the art that when 
referring to "cases" as above, one is referring to an organizational aspect of 
document indexing. For example, a single container such as a box may 
contain a series of files which each of the files containing a number of papers. 
Although each of these files is contained in the same container/box, it does not 
necessarily mean that each of the files is associated with the same case. 
Continuing with the example, file #1 in container A may relate to a litigation 
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between X and Y while file #2 in container A may relate to a litigation 
between P and Q. Based upon labeling of the files or review by an individual 
with the requisite knowledge to do so, files within a specific container may be 
assigned to one or more cases and such assignment is input into system 100. 
[0055] After the detailed inventory verification associating documents 
within containers to specific cases as described above, a notification may be 
sent to the document owner which such notification serves to report the actual 
inventory received at case-level. This notification may be automatically 
generated by process control 170 upon completion of the detailed case level 
inventory. 

[0056] Thus, CHECK-IN step 210 effects integrity verification at box- 
level by verifying the number of containers checked in. In addition, CHECK- 
IN step 210 records the specific cases associated with the container. For each 
box checked in, the following information, for example, may be recorded: 

• Project Name 

• Owner Information Including Name, Address, etc. 

• The Number of Cases 

• The Identification for Each Case 

• The Transportation Personnel Information 

• Check-In Site Information 

• Check-In Time/Date Stamps 

• Check-In Personnel Information 

• Assigned Box ID 

• Comments 

[0057] Other and/or alternative data may be required by and/or otherwise 
supplied to system 100 in connection with the detailed case inventory sub-step 
of CHECK-IN step 2 1 0. The next aspect of CHECK-IN step 2 1 0 calls for the 
generation and printing of tracking sheets with unique barcode information. 
The number of tracking sheets printed out depends up the number of 
preparation steps to be conducted (see below) plus an additional tracking sheet 
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for overall box tracking purposes. This is because each step requires one 
operator to complete and each operator preferable signs off on a tracking sheet 
specifically designed and printed out for him/her. Therefore, at the end of the 
day, a supervisor can gather all the tracking sheets from different operators at 
different operation stages and generate a report and production statistics. 
Further, a tracking sheet is preferably used for overall status tracking of a box 
and operations on the box. Thus, for example, if preparation (see below) 
involves document preparation and labeling, the number of tracking sheet 
printed out will be three (3). The tracking sheets are placed into the box 
associated with each tracking sheet. Thus, in the above example, three 
tracking sheets would be placed in each box with each tracking sheet 
containing the specific barcode associated with the specific box. 
[0058] The next step in the process is PREPARATION step 215. 
PREPARATION step 215 includes the sub-steps of major document 
preparation and integrity treatment at the case level and at the document level. 
The document preparation includes procedures to make each document pages 
scannable. For instance, the document preparation may involve staple 
removal, document repairing, putting small object documents on regular 
paper, etc. The case-level integrity treatment is accomplished by creating 
separator barcode sheets to group documents within a case and by using a 
special document folder methodology a discussed below. The case-level 
integrity treatment is performed to guarantee the integrity of a case when 
documents in the case are separated and scanned at different scan stations. 
Examples of such a separator sheets are provided in Figs. 3-7. 
[0059] An operator who prepares the document for imaging should be 
trained to be able to recognize what is a case, what is a document and what 
category a document belongs to, as well as having some additional knowledge 
concerning how papers should grouped in a case. When a case starts, the 
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operator performing document preparation places the corresponding Case 
Level Separator as the leading page of the case documents. In the case of the 
example discussed above, when the operator recognizes the start point of the 
Correspondence group of paper, he/she places a Level 1 Separator - 
Correspondence as the leading page of the group. Inside the group, where a 
document is identified, the operator places a Level 2 Separator - Document as 
the leading page of the group of documents. There is no difference which 
group, Correspondence, Background, or Pleadings, is identified first the case. 
In other words, the operator is required only to place a corresponding 
separator barcode sheet as a leading page of the corresponding group of 
papers recognized. Information on a separator sheet can be anything that is 
visually noticeable and meaningful. The only mandatory element is the 
barcode. For example, in the barcode examples of Fig. 3-7 and discussed 
below, there is information about the level of the separator and the project 
name. 

[0060] The document-level integrity treatment undertaken during the 
PREPARATION step 215 calls for placing a unique label with a sequence 
number on each page of each document (PAGE ID label). The sequencing 
may be based upon any reasonable methodology including, for example, Bates 
numbers or other schemes that follow a numerical sequence such that each 
page is associated with a unique document number. The case-level and 
document level treatments described above including the generation of 
separator sheets and unique document number labels, respectively, ensure that 
the pages and sequence of the pages can be captured in digital form without 
losing information associated with the documents in their original form and 
their original position, i.e. document and case integrity. 
[0061] The document integrity aspect of PREPARATION step 2 1 5 can, in 
some cases, require the treatment of variety of type of documents, such as 
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microfilm, microfiche, X-ray film, wide format drawing, etc. Each of these 
special document or page(s) (i.e. non-paper based documents) is placed into a 
special document folder and located according to its original sequence along 
with the documents before and after it. The special document folder is then 
labeled with a unique PAGE ID barcode. The PAGE ID barcode for non- 
paper documents may contain one or more special fields identifying the nature 
of the document material, such as 

• F to specify microfiche 

• M to specify microfilm 

• X to specify X-ray film 

• D to specify wide format drawing 

[0062] In addition to placing a PAGE ID on a special document folder, it 
is preferable that a special barcode page with the same barcode as the PAGE 
ID be inserted into the paper document sequence (outside of the special 
document folder, either immediately before or immediately after the folder) 
wherever the special document page is exactly located. After finishing 
preparation of an entire box, an operator places an Operator ID Barcode label 
on one of the tracking sheets for that box and the operator checks a box on the 
tracking sheet to indicate that the preparation task has been done. 
[00631 The next step in the digitization process of the present invention is 
the PREPARATION AUDIT step 220. This step involves the auditing of all 
information about each box. In connection with this step, an operator verifies 
whether or not the separator sheets are all matched with cases as well as the 
continuity of the barcode sequence of the separators. Additionally, the audit 
step sets up the sequence or positions of special documents in a case. 
Therefore, it guarantees the integrity of the case documents scanned at 
different scan stations. In connection with this step 220 the following 
exemplary information may be collected: 
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• Name of Person Who Performed the Audit 

• Date/Time Stamp of the Audit 

• Comments 



[0064] The DIGITIZATION step 225 is next. In this step, all documents 
are scanned box-by-box. In order to scan all pages in a box, the box may need 
go through different scanners either to increase throughput or because a box 
contains more than one document format. For example, if a document 
includes both paper pages and microfiche, the box containing this document 
must be scanned using a paper scanner for digitization of the papers, and then 
go to microfiche scanner for scanning of the microfiche. According to the 
teachings of the present invention, either format may be scanned first or they 
may be scanned in parallel. Thus, it is possible with the novel process of the 
present invention to physically separate the source documents for scanning 
with the knowledge that they will be properly ordered in digital form. 
Because all special format pages are registered during PREPARATION 
AUDIT step 220, it is not necessary to define which scanner must be used 
first. 

[0065] The next step in the process is the BATCH INTEGRATION step 
230. During this step, special images of special document pages are matched 
with their corresponding special barcode pages' images inserted during 
PREPARATION step 215, and each special barcode page image is appended 
with the corresponding image of the special document page. 
[0066] The next step in the process is the IMAGING QUALITY 
CONTROL (QC) step 235. This step ensures that the image quality of the 
scans meet the requirements dictated by the customer. Also, during this step, 
the operator ensures the integrity of document images, that no scans were 
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missed, and that no documents were mis-ordered. In case of any problems, 
rescanning is performed. 

[00671 Following the IMAGING QC step 235, the QC AUDIT step 240 is 
performed. The QC AUDIT step 240 sets and enforces quality control rules 
including Optical Character Recognition (OCR) operation, which detects 
labels attached to each document page. The QC AUDIT step 240 interactively 
drives the QC process to verify the validation of the IMAGING QC. The OCR 
operation will look for a label on each image which was placed on the 
corresponding document page during the labeling process. It then reports to 
imaging QC operators any possibilities of missing label and out-of-sequence 
conditions and requires the operator to take actions of verification, validation, 
and correction. 

[0068] The next step is the IMAGE DELIVERY step 245. In this step, 
output devices 130 under the control of process control 170 act to convert 
digitized images for storage on different physical media and into the formats 
required by the customer. Also, the images may be made ready to import into 
other systems for use by those systems or to deliver the images via CD-ROM, 
DVD, tapes, and other electronic media. During this step, all special barcode 
images added into case for imaging preparation may be removed. 
[0069] The CHECK-OUT step 250 is next. During this step, an operator 
prepares the boxes for either return to their owner, off-site storage or 
destruction . Following this step is the RECEIPT SIGN-OFF step 255. This 
step is performed if the boxes need to be sent back to their owner or to some 
other location. This step includes box transportation and sign-off for receipt of 
the boxes. Finally, DESTRUCTION CERTIFICATE step 260 may be 
performed. During this step, if applicable, all papers in all the boxes are 
destroyed, and a certificate is issued for each box destroyed. 
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[0070] Any paper created by the system, such as tracking sheets, log 
sheets, and reports preferably have special unique LOG PAGE barcodes 
printed thereon. With the barcode, the system is able to detect any system 
generated paper scanned, which avoids any irrelevant document scanning by 
mistake. Illustrated in Fig. 9, two LOG PAGE barcodes are printed on a box 
Tracking Sheet. They are printed in portrait and landscape respectively to 
make sure the sheet is detected if scanned into system in both directions. 
[0071] The foregoing disclosure of the preferred embodiments of the 
present invention has been presented for purposes of illustration and 
description. It is not intended to be exhaustive or to limit the invention to the 
precise forms disclosed. Many variations and modifications of the 
embodiments described herein will be apparent to one of ordinary skill in the 
art in light of the above disclosure. The scope of the invention is to be defined 
only by the claims, and by their equivalents. 
[00721 What is claimed is: 



